vnNLI - VLSP2021: An Empirical Study on Vietnamese-English Natural Language Inference Based on Pretrained Language Models with Data Augmentation
نویسندگان
چکیده
In this paper, we describe an empirical study of data augmentation techniques with various pre-trained language models on the bilingual dataset which was presented at VLSP 2021 - Vietnamese and English-Vietnamese Textual Entailment. We apply machine translation tool to generate new training set from original then investigate compare effectiveness a monolingual multilingual model set. Our experimental results show that fine-tuning XLM-R augmented gives best performance. system ranked third in shared-task F1-score about 0.88.
منابع مشابه
effects of first language on second language writing-a preliminary contrastive rhetoric study of farsi and english
to explore the idea the investingation proposed, aimed at finding whether the performances of the population of iranians students studying english in an efl context are consistent in l1 and l2 writing taks and whether there is a cross-linguistic transfer in this respect. in this regard the subjects were instructed to write four compositions-two in english and two in farsi-which consisted of an ...
15 صفحه اولthe role of task-based techniques on the acquisition of english language structures by the intermediate efl students
this study examines the effetivenss of task-based activities in helping students learn english language structures for a better communication. initially, a michigan test was administered to the two groups of 52 students majoring in english at the allameh ghotb -e- ravandi university to ensure their homogeneity. the students scores on the grammar part of this test were also regarded as their pre...
15 صفحه اولAn overview of empirical natural language processing.(Natural Language
In recent years, there has been a resurgence in research on empirical methods in natural language processing. These methods employ learning techniques to automatically extract linguistic knowledge from natural language corpora rather than require the system developer to manually encode the requisite knowledge. The current special issue reviews recent research in empirical methods in speech reco...
متن کاملAn Ontology Specification Language Based on a Controlled Natural Language
The specification of ontologies has always been a complex problem, especially due to the difficulty of acquiring knowledge from domain experts. This paper proposes an ontology specification language based on a Controlled Natural language to be used by domain experts in the specification of their own ontologies. The language was made in order to tolerate the use of linguistic mechanism like anap...
متن کاملA Study on Inference Control in Natural Language Processing
Natural language processing requires exible control of computation on various sorts of constraints such as syntax, semantics, pragmatics. This study aims to propose and verify a new approach that describes a system declaratively with constraints and controls inferences guided by general principles based on probability. This approach is an alternative one against the previous procedural approach...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: VNU Journal of Science: Computer Science and Communication Engineering
سال: 2022
ISSN: ['2615-9260', '2588-1086']
DOI: https://doi.org/10.25073/2588-1086/vnucsce.330